home
***
CD-ROM
|
disk
|
FTP
|
other
***
search
/
Collection of Internet
/
Collection of Internet.iso
/
infosrvr
/
dev
/
www_talk.930
/
000296_davis@dri.cornell.edu _Wed Nov 4 22:37:10 1992.msg
< prev
next >
Wrap
Internet Message Format
|
1994-01-24
|
1KB
Return-Path: <davis@dri.cornell.edu>
Received: from dxmint.cern.ch by nxoc01.cern.ch (NeXT-1.0 (From Sendmail 5.52)/NeXT-2.0)
id AA20453; Wed, 4 Nov 92 22:37:10 MET
Received: by dxmint.cern.ch (dxcern) (5.57/3.14)
id AA10754; Wed, 4 Nov 92 22:49:28 +0100
Received: by willow.tc.cornell.edu id AA01760
(5.65c/IDA-1.4.4 for www-talk@nxoc01.cern.ch); Wed, 4 Nov 1992 16:48:34 -0500
Date: Wed, 4 Nov 1992 16:48:34 -0500
From: Jim Davis <davis@dri.cornell.edu>
Message-Id: <199211042148.AA01760@willow.tc.cornell.edu>
To: wei@xcf.berkeley.edu, www-talk@nxoc01.cern.ch
Subject: improved printing of WWW files
If you can't quite manage to live without hardcopy, you may
wish sometimes to print WWW files. I have written a couple
of scripts to do this. They are particularly useful with
Pei Wei's excellent Viola WWW browser.
A tar archive is available for anonymous FTP:
dri.cornell.edu/pub/davis/print-www.tar
It contains:
README
print-www
print-www.l
html-to-latex
html2latex.sed (modified version of original CERN version)
The hardest part was writing the perl script to obtain documents
via http protocol - turns out you cant just run pipes through telnet.
The conversion from HTML to LaTex is not really robust yet -
this is doubly hard since there is no guarentee that the HTML
is legal. But at least it works for my test cases. No doubt
it will be improved in time.
best wishes